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<110> TAKAKURA, Hikaru 
MORISHITA, Mio 
YAMAMOTO, Katsuhiko 
MITTA, Masanori 
ASADA, Kiyozo 
TSUNASAWA, Susumu 
KATO, Ikunoshin 

<120> ULTRATHERMOS TABLE PROTEASE GENES 

<130> T AKAKU RA= 1 A 

<140> 09/841,553 
<141> 2001-04-24 

<150> 08/894,818 
<151> 1997-08-29 

<150> JP32385/1995 
<151> 1995-12-12 

<150> JP96/03253 
<151> 1996-11-07 

<160> 45 

<170> Patentln version 3.2 

<210> 1 
<211> 659 
<212> PRT 

<213> Thermococcus celer 
<400> 1 

Met Lys Arg Leu Gly Ala Val Val Leu Ala Leu Val Leu Val Gly Leu 
15 10 15 



Leu Ala Gly Thr Ala Leu Ala Ala Pro Val Lys Pro Val Val Arg Asn 
20 25 " 30 



Asn Ala Val Gin Gin Lys Asn Tyr Gly Leu Leu Thr Pro Gly Leu Phe 
35 40 45 



Lys Lys Val Gin Arg Met Asn Trp Asn Gin Glu Val Asp Thr Val He 
50 55 60 



Met Phe Gly Ser Tyr Gly Asp Arg Asp Arg Ala Val Lys Val Leu Arg 
65 70 75 80 



Leu Met Gly Ala Gin Val Lys Tyr Ser Tyr Lys He He Pro Ala Val 
85 90 95 



Ala Val Lys lie Lys Ala Arg Asp Leu Leu Leu lie Ala Gly Met lie 
100 105 110 



Asp Thr Gly Tyr Phe Gly Asn Thr Arg Val Ser Gly He Lys Phe He 
115 120 125 



Gin Glu Asp Tyr Lys Val Gin Val Asp Asp Ala Thr Ser Val Ser Gin 
130 135 140 



He Gly Ala Asp Thr Val Trp Asn Ser Leu Gly Tyr Asp Gly Ser Gly 
145 150 155 " ~ 160 



Val Val Val Ala He Val Asp Thr Gly He Asp Ala Asn His Pro Asp 
165 170 175 



Leu Lys Gly Lys Val He Gly Trp Tyr Asp Ala Val Asn Gly Arg Ser 
180 185 190 



Thr Pro Tyr Asp Asp Gin Gly His Gly Thr His Val Ala Gly He Val 
195 200 205 



Ala Gly Thr Gly Ser Val Asn Ser Gin Tyr He Gly Val Ala Pro Gly 
210 215 220 



Ala Lys Leu Val Gly Val Lys Val Leu Gly Ala Asp Gly Ser Gly Ser 
225 230 235 ~ ' 240 



Val Ser Thr He He Ala Gly Val Asp Trp Val Val Gin Asn Lys Asp 
245 250 255 



Lys Tyr Gly He Arg Val He Asn Leu Ser Leu Gly Ser Ser Gin Ser 
260 265 270 



Ser Asp Gly Thr Asp Ser Leu Ser Gin Ala Val Asn Asn Ala Trp Asp 
275 280 285 



Ala Gly He Val Val Cys Val Ala Ala Gly Asn Ser Gly Pro Asn Thr 
290 295 300 



Tyr Thr Val Gly Ser Pro Ala Ala Ala Ser Lys Val He Thr Val Gly 
305 310 315 320 
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Ala Val Asp Ser Asn Asp Asn lie Ala Ser Phe Ser Ser Arg Gly Pro 
325 330 335 



Thr Ala Asp Gly Arg Leu Lys Pro Glu Val Val Ala Pro Gly Val Asp 
340 345 350 



lie lie Ala Pro Arg Ala Ser Gly Thr Ser Met Gly Thr Pro lie Asn 
355 360 365 



Asp Tyr Tyr Thr Lys Ala Ser Gly Thr Ser Met Ala Thr Pro His Val 
370 375 380 



Ser Gly Val Gly Ala Leu lie Leu Gin Ala His Pro Ser Trp Thr Pro 
385 390 395 " 400 



Asp Lys Val Lys Thr Ala Leu lie Glu Thr Ala Asp lie Val Ala Pro 
405 410 415 



Lys Glu lie Ala Asp lie Ala Tyr Gly Ala Gly Arg Val Asn Val Tyr 
420 425 430 



Lys Ala lie Lys Tyr Asp Asp Tyr Ala Lys Leu Thr Phe Thr Gly Ser 
435 440 445 



Val Ala Asp Lys Gly Ser Ala Thr His Thr Phe Asp Val Ser Gly Ala 
450 455 460 



Thr Phe Val Thr Ala Thr Leu Tyr Trp Asp Thr Gly Ser Ser Asp lie 
465 470 475 480 



Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Glu Val Asp Tyr Ser Tyr 
485 490 1 495 



Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro Thr Ala 
500 505 510 



Gly Thr Trp Thr Val Lys Val Val Ser Tyr Lys Gly Ala Ala Asn Tyr 
515 520 525 



Gin Val Asp Val Val Ser Asp Gly Ser Leu Ser Gin Ser Gly Gly Gly 
530 535 540 



Asn Pro Asn Pro Asn Pro Asn Pro Asn Pro Thr Pro Thr Thr Asp Thr 
545 550 555 560 
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Gin Thr Phe Thr Gly Ser Val Asn Asp Tyr Trp Asp Thr Ser Asp Thr 
565 570 * 575 



Phe Thr Met Asn Val Asn Ser Gly Ala Thr Lys lie Thr Gly Asp Leu 
580 585 590 



Thr Phe Asp Thr Ser Tyr Asn Asp Leu Asp Leu Tyr Leu Tyr Asp Pro 
595 600 605 



Asn Gly Asn Leu Val Asp Arg Ser Thr Ser Ser Asn Ser Tyr Glu His 
610 615 620 



Val Glu Tyr Ala Asn Pro Ala Pro Gly Thr Trp Thr Phe Leu Val Tyr 
625 630 635 640 



Ala Tyr Ser Thr Tyr Gly Trp Ala Asp Tyr Gin Leu Lys Ala Val Val 
645 650 655 



Tyr Tyr Gly 



<210> 2 

<211> 1977 

<212> DNA 

<213> Thermococcus celer 

<400> 2 



atgaagaggt 


taggtgctgt 


ggtgctggca 


ctggtgctcg 


tgggtcttct 


ggccggaacg 


60 


gcccttgcgg 


cacccgtaaa 


accggttgtc 


aggaacaacg 


cggttcagca 


gaagaactac 


120 


ggactgctga 


ccccgggact 


gttcaagaaa 


gtccagagga 


tgaactggaa 


ccaggaagtg 


180 


gacaccgtca 


taatgttcgg 


gagctacgga 


gacagggaca 


gggcggttaa 


ggtactgagg 


240 


ctcatgggcg 


cccaggtcaa 


gtactcctac 


aagataatcc 


ctgctgtcgc 


ggttaaaata 


300 


aaggccaggg 


accttctgct 


gatcgcgggc 


atgatagaca 


cgggttactt 


cggtaacaca 


3bU 


agggtctcgg 


gcataaagtt 


catacaggag 


gattacaagg 


ttcaggttga 


cgacgccact 


420 


tccgtctccc 


agataggggc 


cgataccgtc 


tggaactccc 


tcggctacga 


cggaagcggt 


480 


gtggtggttg 


ccatcgtcga 


tacgggtata 


gacgcgaacc 


accccgatct 


gaagggcaag 


540 


gtcataggct 


ggtacgacgc 


cgtcaacggc 


aggtcgaccc 


cctacgatga 


ccagggacac 


600 


ggaacccacg 


ttgcgggtat 


cgttgccgga 


accggcagcg 


ttaactccca 


gtacataggc 


660 
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gtcgcccccg 


gcgcgaagct 


cgtcggcgtc 


aaggttctcg 


gtgccgacgg 


ttcgggaagc 


720 


gtctccacca 


tcatcgcggg 


tgttgactgg 


gtcgtccaga 


acaaggacaa 


gtacgggata 


780 


agggtcatca 


acctctccct 


cggctcctcc 


cagagctccg 


acggaaccga 


ctccctcagt 


840 


caggccgtca 


acaacgcctg 


ggacgccggt 


atagtagtct 


gcgtcgccgc 


cggcaacagc 


900 


gggccgaaca 


cctacaccgt 


cggctcaccc 


gccgccgcga 


gcaaggtcat 


aaccgtcggt 


960 


gcagttgaca 


gcaacgacaa 


catcgccagc 


ttctccagca 


ggggaccgac 


cgcggacgga 


1020 


aggctcaagc 


cggaagtcgt 


cgcccccggc 


gttgacatca 


tagccccgcg 


cgccagcgga 


1080 


accagcatgg 


gcaccccgat 


aaacgactac 


tacaccaagg 


cctctggaac 


cagcatggcc 


1140 


accccgcacg 


tttcgggcgt 


tggcgcgctc 


atcctccagg 


cccacccgag 


ctggaccccg 


1200 


gacaaggtga 


agaccgccct 


catcgagacc 


gccgacatag 


tcgcccccaa 


ggagatagcg 


1260 


gacatcgcct 


acggtgcggg 


tagggtgaac 


gtctacaagg 


ccatcaagta 


cgacgactac 


1320 


gccaagctca 


ccttcaccgg 


ctccgtcgcc 


gacaagggaa 


gcgccaccca 


caccttcgac • 


1380 


gtcagcggcg 


ccaccttcgt 


gaccgccacc 


ctctactggg 


acacgggctc 


gagcgacatc 


1440 


gacctctacc 


tctacgaccc 


caacgggaac 


gaggttgact 


actcctacac 


cgcctactac 


1500 


ggcttcgaga 


aggtcggcta 


ctacaacccg 


accgccggaa 


cctggacggt 


caaggtcgtc 


1560 


agctacaagg 


gcgcggcgaa 


ctaccaggtc 


gacgtcgtca 


gcgacgggag 


cctcagccag 


1620 


tccggcggcg 


gcaacccgaa 


tccaaacccc 


aacccgaacc 


caaccccgac 


caccgacacc 


1680 


cagaccttca 


ccggttccgt 


taacgactac 


tgggacacca 


gcgacacctt 


caccatgaac 


1740 


gtcaacagcg 


gtgccaccaa 


gataaccggt 


gacctgacct 


tcgatacttc 


ctacaacgac 


1800 


ctcgacctct 


acctctacga 


ccccaacggc 


aacctcgttg 


acaggtccac 


gtcgagcaac 


1860 


agctacgagc 


acgtcgagta 


cgccaacccc 


gccccgggaa 


cctggacgtt 


cctcgtctac 


1920 


gcctacagca 


cctacggctg 


ggcggactac 


cagctcaagg 


ccgtcgtcta 


ctacggg 


1977 



<210> 3 

<211> 522 

<212> PRT 

<213> Pyrococcus furiosus 
<220> 

<221> misc_f eature 

<222> (428) . . (428) 

<223> Xaa is Gly or Val 

<400> 3 
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Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gin Val Met Ala Thr 
15 10 15 



Tyr Val Trp Asn Leu Gly Tyr Asp Gly Ser Gly lie Thr lie Gly lie 
20 25 30 



lie Asp Thr Gly lie Asp Ala Ser His Pro Asp Leu Gin Gly Lys Val 
35 " 40 45 



lie Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr Pro Tyr Asp Asp 
50 55 60 



His Gly His Gly Thr His Val Ala Ser He Ala Ala Gly Thr Gly Ala 
65 70 75 80 



Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala Pro Gly Ala Lys Leu Ala 
85 90 95 



Gly He Lys Val Leu Gly Ala Asp Gly Ser Gly Ser He Ser Thr He 
100 105 110 



He Lys Gly Val Glu Trp Ala Val Asp Asn Lys Asp Lys Tyr Gly He 
115 120 ^ * 125 



Lys Val He Asn Leu Ser Leu Gly Ser Ser Gin Ser Ser Asp Gly Thr 
130 135 140 



Asp Ala Leu Ser Gin Ala Val Asn Ala Ala Trp Asp Ala Gly Leu Val 
145 150 155 160 



Val Val Val Ala Ala Gly Asn Ser Gly Pro Asn Lys Tyr Thr He Gly 
165 170 175 



Ser Pro Ala Ala Ala Ser Lys Val He Thr Val Gly Ala Val Asp Lys 
180 185 190 



Tyr Asp Val lie Thr Ser Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly 
195 200 205 



Arg Leu Lys Pro Glu Val Val Ala Pro Gly Asn Trp lie Tie Ala Ala 
210 215 220 



Arg Ala Ser Gly Thr Ser Met Gly Gin Pro He Asn Asp Tyr Tyr Thr 
225 230 235 240 
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Ala Ala Pro Gly Thr Ser Met Ala Thr Pro His Val Ala Gly lie Ala 
245 250 255 



Ala Leu Leu Leu Gin Ala His Pro Ser Trp Thr Pro Asp Lys Val Lys 
260 265 * 270 



Thr Ala Leu lie Glu Thr Ala Asp lie Val Lys Pro Asp Glu lie Ala 
275 280 285 



Asp lie Ala Tyr Gly Ala Gly Arg Val Asn Ala Tyr Lys Ala lie Asn 
290 295 300 



Tyr Asp Asn Tyr Ala Lys Leu Val Phe Thr Gly Tyr Val Ala Asn Lys 
305 310 315 320 



Gly Ser Gin Thr His Gin Phe Val lie Ser Gly Ala Ser Phe Val Thr 
325 330 335 



Ala Thr Leu Tyr Trp Asp Asn Ala Asn Ser Asp Leu Asp Leu Tyr Leu 
340 345 350 



Tyr Asp Pro Asn Gly Asn Gin Val Asp Tyr Ser Tyr Thr Ala Tyr Tyr 
355 360 "* 365 



Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro Thr Asp Gly Thr Trp Thr 
370 375 380 



lie Lys Val Val Ser Tyr Ser Gly Ser Ala Asn Tyr Gin Val Asp Val 
385 390 395 400 



Val Ser Asp Gly Ser Leu Ser Gin Pro Gly Ser Ser Pro Ser Pro Gin 
405 410 415 



Pro Glu Pro Thr Val Asp Ala Lys Thr P he Gin Xaa Ser Asp His T yr 
420 425 430 



Tyr Tyr Asp Arg Ser Asp Thr Phe Thr Met Thr Val Asn Ser Gly Ala 
435 440 445 



Thr Lys lie Thr Gly Asp Leu Val Phe Asp Thr Ser Tyr His Asp Leu 
450 455 460 
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Asp Leu Tyr Leu Tyr Asp Pro Asn Gin Lys Leu Val Asp Arg Ser Glu 
465 470 475 480 



Ser Pro Asn Ser Tyr Glu His Val Glu Tyr Leu Thr Pro Ala Pro Gly 
485 490 495 



Thr Trp Tyr Phe Leu Val Tyr Ala Tyr Tyr Thr Tyr Gly Trp Ala Tyr 
500 505 ^ 510 



Tyr Glu Leu Thr Ala Lys Val Tyr Tyr Gly 
515 520 



<210> 4 

<211> 1566 

<212> DNA 

<213> Pyrococcus furiosus 



<220> 

<221> mis cofeature 

<222> (1283) . . (1283) 

<223> n is G or T 

<400> 4 



gcagaattag 


aaggactgga 


tgagtctgca 


gctcaagtta 


tggcaactta 


cgtttggaac 


60 


ttgggatatg 


atggttctgg 


aatcacaata 


ggaataattg 


acactggaat 


tgacgcttct 


120 


catccagatc 


tccaaggaaa 


agtaattggg 


tgggtagatt 


ttgtcaatgg 


taggagttat 


180 


ccatacgatg 


accatggaca 


tggaactcat 


gtagcttcaa 


tagcagctgg 


tactggagca 


240 


gcaagtaatg 


gcaagtacaa 


gggaatggct 


ccaggagcta 


agctggcggg 


aattaaggtt 


300 


ctaggtgccg 


atggttctgg 


aagcatatct 


actataatta 


agggagttga 


gtgggccgtt 


360 


gataacaaag 


ataagtacgg 


aattaaggtc 


attaatcttt 


ctcttggttc 


aagccagagc 


420 


tcagatggta 


ctgacgctct 


aagtcaggct 


gttaatgcag 


cgtgggatgc 


tggattagtt 


480 


gttgtggttg 


ccgctggaaa 


cagtggacct 


aacaagtata 


caatcggttc 


tccagcagct 


540 


gcaagcaaag 


ttattacagt 


rggagccgrr 


gacaagtarg 


argttaraac 


aagcttctca 


— eotr 


agcagagggc 


caactgcaga 


cggcaggctt 


aagcctgagg 


ttgttgctcc. aggaaactgg 


660 


ataattgctg 


ccagagcaag 


tggaactagc 


atgggtcaac 


caattaatga 


ctattacaca 


720 


gcagctcctg 


ggacatcaat 


ggcaactcct 


cacgtagctg 


gtattgcagc 


cctcttgctc 


780 


caagcacacc 


cgagctggac 


tccagacaaa 


gtaaaaacag 


ccctcataga 


aactgctgat 


840 


atcgtaaagc 


cagatgaaat 


agccgatata 


gcctacggtg 


caggtagggt 


taatgcatac 


900 
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aaggctataa 


actacgataa 


ctatgcaaag 


ctagtgttca 


ctggatatgt 


tgccaacaaa 


960 


ggcagccaaa 


ctcaccagtt 


cgttattagc 


ggagcttcgt 


tcgtaactgc 


cacattatac 


1020 


tgggacaatg 


ccaatagcga 


ccttgatctt 


tacctctacg 


atcccaatgg 


aaaccaggtt 


1080 


gactactctt 


acaccgccta 


ctatggattc 


gaaaaggttg 


gttattacaa 


cccaactgat 


1140 


ggaacatgga 


caattaaggt 


tgtaagctac 


agcggaagtg 


caaactatca 


agtagatgtg 


1200 


gtaagtgatg 


gttccctttc 


acagcctgga 


agttcaccat 


ctccacaacc 


agaaccaaca 


1260 


gtagacgcaa 


agacgttcca 


agnatccgat 


cactactact 


atgacaggag 


cgacaccttt 


1320 


acaatgaccg 


ttaactctgg 


ggctacaaag 


attactggag 


acctagtgtt 


tgacacaagc 


1380 


taccatgatc 


ttgaccttta 


cctctacgat 


cctaaccaga 


agcttgtaga 


tagatcggag 


1440 


agtcccaaca 


gctacgaaca 


cgtagaatac 


ttaacccccg 


ccccaggaac 


ctggtacttc 


1500 


ctagtatatg 


cctactacac 


ttacggttgg 


gcttactacg 


agctgacggc 


taaagtttat 


1560 


tatggc 












1566 



<210> 5 
<211> 659 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 5 

Met Lys Gly Leu Lys Ala Leu lie Leu Val lie Leu Val Leu Gly Leu 
15 10 15 



Val Val Gly Ser Val Ala Ala Ala Pro Glu Lys Lys Val Glu Gin Val 
20 25 30 



Arg Asn Val Glu Lys Asn Tyr Gly Leu Leu Thr Pro Gly Leu Phe Arg 
35 4 0 4 5 



Lys lie Gin Lys Leu Asn Pro Asn Glu Glu lie Ser Thr Val lie Val 
50 55 60 



Phe Glu Asn His Arg Glu Lys Glu lie Ala Val Arg Val Leu Glu Leu 
65 70 75 80 



Met Gly Ala Lys Val Arg Tyr Val Tyr His He He Pro Ala He Ala 
85 90 95 
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Ala Asp Leu Lys Val Arg Asp Leu Leu Val lie Ser Gly Leu Thr Gly 
100 105 110 



Gly Lys Ala Lys Leu Ser Gly Val Arg Phe lie Gin Glu Asp Tyr Lys 
115 120 125 



Val Thr Val Ser Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gin 
130 135 140 



Val Met Ala Thr Tyr Val Trp Asn Leu Gly Tyr Asp Gly Ser Gly lie 
145 150 155 160 



Thr lie Gly lie lie Asp Thr Gly lie Asp Ala Ser His Pro Asp Leu 
165 " 170 175 



Gin Gly Lys Val lie Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr 
180 185 190 



Pro Tyr Asp Asp His Gly His Gly. Thr His Val Ala Ser lie Ala Ala 
195 200 205 



Gly Thr Gly Ala Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala Pro Gly 
210 215 220 



Ala Lys Leu Ala Gly lie Lys Val Leu Gly Ala Asp Gly Ser Gly Ser 
225 230 235 240 



lie Ser Thr lie lie Lys Gly Val Glu Trp Ala Val Asp Asn Lys Asp 
245 250 255 



Lys Tyr Gly lie Lys Val lie Asn Leu Ser Leu Gly Ser Ser Gin Ser 
260 265 270 



Ser Asp Gly Thr Asp Ser Leu Ser Gin Ala Val Asn Asn Ala Trp Asp 
275" 2^0 2H5 



Ala Gly He Val Val Cys Val Ala Ala Gly Asn Ser Gly Pro Asn Thr 
290 295 300 



Tyr Thr Val Gly Ser Pro Ala Ala Ala Ser Lys Val He Thr Val Gly 
305 310 315 320 



Ala Val Asp Ser Asn Asp Asn He Ala Ser Phe Ser Ser Arg Gly Pro 
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325 330 335 



Thr Ala Asp Gly Arg Leu Lys Pro Glu Val Val Ala Pro Gly Val Asp 
340 345 350 



lie lie Ala Pro Arg Ala Ser Gly Thr Ser Met Gly Thr Pro lie Asn 
355 ~ 360 ~ 365 



Asp Tyr Tyr Thr Lys Ala Ser Gly Thr Ser Met Ala Thr Pro His Val 
370 375 380 



Ser Gly Val Gly Ala Leu lie Leu Gin Ala His Pro Ser Trp Thr Pro 
385 390 395 400 



Asp Lys Val Lys Thr Ala Leu lie Glu Thr Ala Asp lie Val Ala Pro 
405 410 415 



Lys Glu lie Ala Asp lie Ala Tyr Gly Ala Gly Arg Val Asn Val Tyr 
420 425 430 



Lys Ala lie Lys Tyr Asp Asp Tyr Ala Lys Leu Thr Phe Thr Gly Ser 
435 440 445 



Val Ala Asp Lys Gly Ser Ala Thr His Thr Phe Asp Val Ser Gly Ala 
450 455 460 



Thr Phe Val Thr Ala Thr Leu Tyr Trp Asp Thr Gly Ser Ser Asp lie 
465 470 475 480 



Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Glu Val Asp Tyr Ser Tyr 
485 490 " 495 



Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro Thr Ala 
500 505 510 



Gly Thr Trp Thr Val Lys Val Val Ser Tyr Lys Gly Ala Ala Asn Tyr 
515 520 ~ 525 



Gin Val Asp Val Val Ser Asp Gly Ser Leu Ser Gin Ser Gly Gly Gly 
530 535 * 540 



Asn Pro Asn Pro Asn Pro Asn Pro Asn Pro Thr Pro Thr Thr Asp Thr 
545 550 555 560 
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Gin Thr Phe Thr Gly Ser Val Asn Asp Tyr Trp Asp Thr Ser Asp Thr 
565 570 575 



Phe Thr Met Asn Val Asn Ser Gly Ala Thr Lys lie Thr Gly Asp Leu 
580 585 590 



Thr Phe Asp Thr Ser Tyr Asn Asp Leu Asp Leu Tyr Leu Tyr Asp Pro 
595 600 605 



Asn Gly Asn Leu Val Asp Arg Ser Thr Ser Ser Asn Ser Tyr Glu His 
610 615 620 



Val Glu Tyr Ala Asn Pro Ala Pro Gly Thr Trp Thr Phe Leu Val Tyr 
625 630 635 640 



Ala Tyr Ser Thr Tyr Gly Trp Ala Asp Tyr Gin Leu Lys Ala Val Val 
645 650 655 



Tyr Tyr Gly 



<210> 6 
<211> 1977 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 6 

atgaaggggc tgaaagctct catattagtg attttagttc taggtttggt agtagggagc 60 
gtagcggcag ctccagagaa gaaagttgaa caagtaagaa atgttgagaa gaactatggt .120 
ctgctaacgc caggactgtt cagaaaaatt caaaaattga atcctaacga ggaaatcagc 180 
acagtaattg tatttgaaaa ccatagggaa aaagaaattg cagtaagagt tcttgagtta 240 
atgggtgcaa aagttaggta tgtgtaccat attatacccg caatagctgc cgatcttaag 3TJTT 



gttagagact tactagtcat ctcaggttta acagggggta aagctaagct ttcaggtgtt 360 

aggtttatcc aggaagacta caaagttaca gtttcagcag aattagaagg actggatgag 420 

tctgcagctc aagttatggc aacttacgtt tggaacttgg gatatgatgg ttctggaatc 480 

acaataggaa taattgacac tggaattgac gcttctcatc cagatctcca aggaaaagta 540 

attgggtggg tagattttgt caatggtagg agttatccat acgatgacca tggacatgga 600 
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actcatgtag 


cttcaatagc 


agctggtact 


ggagcagcaa 


gtaatggcaa 


gtacaaggga 


660 


atggctccag 


gagctaagct 


ggcgggaatt 


aaggttctag 


gtgccgatgg 


ttctggaagc 


720 


atatctacta 


taattaaggg 


agttgagtgg 


gccgttgata 


acaaagataa 


gtacggaatt 


780 


aaggtcatta 


atctttctct 


tggttcaagc 


cagagctccg 


acggaaccga 


ctccctcagt 


840 


caggccgtca 


acaacgcctg 


ggacgccggt 


atagtagtct 


gcgtcgccgc 


cggcaacagc 


900 


gggccgaaca 


cctacaccgt 


cggctcaccc 


gccgccgcga 


gcaaggtcat 


aaccgtcggt " 


960 


gcagttgaca 


gcaacgacaa 


catcgccagc 


ttctccagca 


ggggaccgac 


cgcggacgga 


1020 


aggctcaagc 


cggaagtcgt 


cgcccccggc 


gttgacatca 


tagccccgcg 


cgccagcgga 


1080 


accagcatgg 


gcaccccgat 


aaacgactac 


tacaccaagg 


cctctggaac 


cagcatggcc 


1140 


accccgcacg 


tttcgggcgt 


tggcgcgctc 


atcctccagg 


cccacccgag 


ctggaccccg 


1200 


gacaaggtga 


agaccgccct 


catcgagacc 


gccgacatag 


tcgcccccaa 


ggagatagcg 


1260 


gacatcgcct 


acggtgcggg 


tagggtgaac 


gtctacaagg 


ccatcaagta 


cgacgactac 


1320 


gccaagctca 


ccttcaccgg 


ctccgtcgcc 


gacaagggaa 


gcgccaccca 


caccttcgac 


1380 


gtcagcggcg 


ccaccttcgt 


gaccgccace 


ctctactggg 


acacgggctc 


gagcgacatc 


1440 


gacctctacc 


tctacgaccc 


caacgggaac 


gaggttgact 


actcctacac 


cgcctactac 


1500 


ggcttcgaga 


aggtcggcta 


ctacaacccg 


accgccggaa 


cctggacggt 


caaggtcgtc 


1560 


agctacaagg 


gcgcggcgaa 


ctaccaggtc 


gacgtcgtca 


gcgacgggag 


cctcagccag 


1620 


tccggcggcg 


gcaacccgaa 


tccaaacccc 


aacccgaacc 


caaccccgac 


caccgacacc 


1680 


cagaccttca 


ccggttccgt 


taacgactac 


tgggacacca 


gcgacacctt 


caccatgaac 


1740 


gtcaacagcg 


gtgccaccaa 


gataaccggt 


gacctgacct 


tcgatacttc 


ctacaacgac 


1800 


ctcgacctct 


acctctacga 


ccccaacggc 


aacctcgttg 


acaggtccac 


gtcgagcaac 


1860 


agctacgagc 


acgtcgagta 


cgccaacccc 


gccccgggaa 


cctggacgtt 


cctcgtctac 


1920 


gcctacagca 


cctacggctg 


ggcggactac 


cagctcaagg 


ccgtcgtcta 


ctacggg 


1977 



<210> 7 
<211> 4765 
<212> DNA 

<213> Pyrococcus furiosus 
<400> 7 

tttaaattat aagatataat cactccgagt gatgagtaag atacatcatt acagtcccaa 60 
aatgtttata attggaacgc agtgaatata caaaatgaat ataacctcgg aggtgactgt 120 
agaatgaata agaagggact tactgtgcta tttatagcga taatgctcct ttcagtagtt 180 
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• 






•> 






ccagtgcact 


ttgtgtccgc 


agaaacacca 


ccggttagtt 


cagaaaattc 


aacaacttct 


:240 


atactcccta 


accaacaagt 


tgtgacaaaa 


gaagtttcac 


aagcggcgct 


taatgctata 


300 


atgaaaggac 


aacccaacat 


ggttcttata 


atcaagacta 


aggaaggcaa 


acttgaagag 


360 


gcaaaaaccg 


agcttgaaaa 


gctaggtgca 


gagattcttg 


acgaaaatag 


agttcttaac 


420 


atgttgctag 


ttaagattaa 


gcctgagaaa 


gttaaagagc 


tcaactatat 


ctcatctctt 


480 


gaaaaagcct 


ggcttaacag 


agaagttaag 


ctttcccctc 


caattgtcga 


aaaggacgtc 


540 


aagactaagg 


agccctccct 


agaaccaaaa 


atgtataaca 


gcacctgggt 


aattaatgct 


600 


ctccagttca 


tccaggaatt 


tggatatgat 


ggtagtggtg 


ttgttgttgc 


agtacttgac 


. 660 


acgggagttg 


atccgaacca 


tcctttcttg 


agcataactc 


cagatggacg 


caggaaaatt 


720 


atagaatgga 


aggattttac 


agacgaggga 


ttcgtggata 


catcattcag 


ctttagcaag 


780 


gttgtaaatg 


ggactcttat 


aattaacaca 


acattccaag 


tggcctcagg 


tctcacgctg 


840 


aatgaatcga 


caggacttat 


ggaatacgtt 


gttaagactg 


tttacgtgag 


caatgtgacc 


900 


attggaaata 


tcacttctgc 


taatggcatc 


tatcacttcg 


gcctgctccc 


agaaagatac 


960 


ttcgacttaa 


acttcgatgg 


tgatcaagag 


gacttctatc 


ctgtcttatt 


agttaactcc . 


1020 


actggcaatg 


gttatgacat 


tgcatatgtg 


gatactgacc 


ttgactacga 


cttcaccgac 


1080 


gaagttccac 


ttggccagta 


caacgttact 


tatgatgttg 


ctgtttttag 


ctactactac 


1140 


ggtcctctca 


actacgtgct 


tgcagaaata 


gatcctaacg 


gagaatatgc 


agtatttggg 


1200 


tgggatggtc 


acggtcacgg 


aactcacgta 


gctggaactg 


ttgctggtta 


cgacagcaac 


1260 


aatgatgctt 


gggattggct 


cagtatgtac 


tctggtgaat 


gggaagtgtt 


ctcaagactc 


1320 


tatggttggg 


attatacgaa 


cgttaccaca 


gacaccgtgc 


agggtgttgc 


tccaggtgcc 


1380 


caaataatgg 


caataagagt 


tcttaggagt 


gatggacggg 


gtagcatgtg 


ggatattata 


1440 


gaaggtatga 


catacgcagc 


aacccatggt 


gcagacgtta 


taagcatgag 


tctcggtgga 


1500 


aatgctccat 


acttagatgg 


tactgatcca 


gaaagcgttg 


ctgtggatga 


gcttaccgaa 


1560 


aagtacggtg 


ttgtattcgt 


aatagctgca 


ggaaatgaag 


gtcctggcat 


taacatcgtt 


1620 


ggaagtcctg 


gtgttgcaac 


aaaggcaata 


actgttggag 


ctgctgcagt 


gcccattaac 


1680 


gttggagttt 


atgtttccca 


agcacttgga 


tatcctgatt 


actatggatt 


ctattacttc 


1740 


cccgcctaca 


eaaacgttag 


aatagcattc 


ttctcaagca 


gagggccgag 


aatagatggt 


1800 


gaaataaaac 


ccaatgtagt 


ggctccaggt 


tacggaattt 


actcatccct 


gccgatgtgg 


1860 


attggcggag 


ctgacttcat 


gtctggaact 


tcgatggcta 


ctccacatgt 


cagcggtgtc 


1920 
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• 






•> 






gttgcactcc 


tcataagcgg 


ggcaaaggcc 


gagggaatat 


actacaatcc 


agatataatt 


1980 


aagaaggttc 


ttgagagcgg 


tgcaacctgg 


cttgagggag 


atccatatac 


tgggcagaag 


2040 


tacactgagc 


ttgaccaagg 


tcatggtctt 


gttaacgtta 


ccaagtcctg 


ggaaatcctt 


2100 


aaggctataa* 


acggcaccac 


tctcccaatt 


gttgatcact 


gggcagacaa 


gtcctacagc 


2160 


gactttgcgg 


agtacttggg 


tgtggacgtt 


ataagaggtc 


tctacgcaag 


gaactctata 


2220 


cctgacattg 


tcgagtggca 


cattaagtac 


gtaggggaca 


cggagtacag 


aacttttgag 


2280 


atctatgcaa 


ctgagccatg 


gattaagcct 


tttgtcagtg 


gaagtgtaat 


tctagagaac 


2340 


aataccgagt 


ttgtccttag 


ggtgaaatat 


gatgtagagg 


gtcttgagcc 


aggtctctat 


2400 


gttggaagga 


taatcattga 


tgatccaaca 


acgccagtta 


ttgaagacga 


gatcttgaac 


2460 


acaattgtta 


ttcccgagaa 


gttcactcct 


gagaacaatt 


acaccctcac 


ctggtatgat 


2520 


attaatggtc 


cagaaatggt 


gactcaccac 


ttcttcactg 


tgcctgaggg 


agtggacgtt 


2580 


ctctacgcga 


tgaccacata 


ctgggactac 


ggtctgtaca 


gaccagatgg 


aatgtttgtg 


2640 


ttcccatacc 


agctagatta 


tcttcccgct 


gcagtctcaa 


atccaatgcc 


tggaaactgg 


2700 


gagctagtat 


ggactggatt 


taactttgca 


cccctctatg 


agtcgggctt 


ccttgtaagg 


2760 


atttacggag 


tagagataac 


tccaagcgtt 


tggtacatta 


acaggacata 


ccttgacact 


2820 


aacactgaat 


tctcaattga 


attcaatatt 


actaacatct 


atgccccaat 


taatgcaact 


2880 


ctaatcccca 


ttggccttgg 


aacctacaat 


gcgagcgttg 


aaagcgttgg 


tgatggagag 


2940 


ttcttcataa 


agggcattga 


agttcctgaa 


ggcaccgcag 


agttgaagat 


taggataggc 


3000 


aacccaagtg 


ttccgaattc 


agatctagac 


ttgtaccttt 


atgacagtaa 


aggcaattta 


3060 


gtggccttag 


atggaaaccc 


aacagcagaa 


gaagaggttg 


tagttgagta 


tcctaagcct 


3120 


ggagtttatt 


caatagtagt 


acatggttac 


agcgtcaggg 


acgaaaatgg 


taatccaacg 


3180 


acaaccacct 


ttgacttagt 


tgttcaaatg 


acccttgata 


atggaaacat 


aaagcttgac 


32.4 0 


aaagactcga 


ttattcttgg 


aagcaatgaa 


agcgtagttg 


taactgcaaa 


cataacaatt 


3300 


ga.tagagatc 


atcctacagg 


agtatactct 


ggtatcatag 


agattagaga 


taatgaggtc 


3360 


taccaggata 


caaatacttc 


aattgcgaaa 


atacccataa 


ctttggtaat 


tgacaaggcg 


3420 


gactttgccg 


ttggtctcac 


accagcagag 


ggagtacttg 


gagaggctag 


aaattacact 


3480 


ctaattgtaa 


agcatgccct 


aacactagag 


cctgtgccaa 


atgctacagt 


gattatagga 


3540 


aactacacct 


acctcacaga 


cgaaaacggt 


acagtgacat 


tcacgtatgc 


tccaactaag 


3600 


ttaggcagtg 


atgaaatcac 


agtcatagtt 


aagaaagaga 


acttcaacac 


attagagaag 


3660 
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• 






• 






accttccaaa 


tcacagtatc 


agagcctgaa 


ataactgaag 


aggacataaa 


tgagcccaag 


3720 


cttgcaatgt 


catcaccaga 


agcaaatgct 


accatagtat 


cagttgagat 


ggagagtgag 


3780 


ggtggcgtta 


aaaagacagt 


gacagtggaa 


ataactataa 


acggaaccgc 


taatgagact 


3840 


gcaacaatag 


tggttcctgt 


tcctaagaag 


gccgaaaaca 


tcgaggtaag 


tggagaccac. 


3900 


gtaatttcct 


atagtataga 


ggaaggagag 


tacgccaagt 


acgttataat 


tacagtgaag 


3960 


tttgcatcac 


ctgtaacagt 


aactgttact 


tacactatct 


atgctggccc 


aagagtctca 


4020 


atcttgacac 


ttaacttcct 


tggctactca 


tggtacagac 


tatattcaca 


gaagtttgac 


4080 


gaattgtacc 


aaaaggccct 


tgaattggga 


gtggacaacg 


agacattagc 


tttagccctc 


4140 


agctaccatg 


aaaaagccaa 


agagtactac 


gaaaaggccc 


ttgagcttag 


cgagggtaac 


4200 


ataatccaat 


accttggaga 


cataagacta 


ttacctccat 


taagacaggc 


atacatcaat 


4260 


gaaatgaagg 


cagttaagat 


actggaaaag 


gccatagaag 


aattagaggg 


tgaagagtaa 


4320 


tctccaattt 


ttcccacttt 


ttcttttata 


acattccaag 


ccttttctta 


gcttcttcgc 


4380 


tcattctatc 


aggagtccat 


ggaggatcaa 


aggtaagttc 


aacctccaca 


tctcttactc 


4440 


ctgggatttc 


gagtactttc 


tcctctacag 


ctctaagaag 


ccagagagtt 


aaaggacacc 


4500 


caggagttgt 


cattgtcatc 


tttatatata 


ccgttttgtc 


aggattaatc 


tttagctcat 


4560 


aaattaatcc 


aaggtttaca 


acatccatcc 


caatttctgg 


gtcgataacc 


tcctttagct 


4620 


tttccagaat 


catttcttca 


gtaatttcaa 


ggttctcatc 


tttggtttct 


ctcacaaacc 


4680 


caatttcaac 


dtgcctgata 


ccttctaact 


ccctaagctt 


gttatatatc 


tccaaaagag 


4740 


tggcatcatc 


aattttctct 


ttaaa 








4765 



<210> 8 

<211> 1398 

<212> PRT 

<213> Pyrococcus furiosus 

<400> 8 



Met Asn Lys Lys Gly Leu Thr Val Leu Phe lie Ala lie Met Leu Leu 
15 10 15 



Ser Val Val Pro Val His Phe Val Ser Ala Glu Thr Pro Pro Val Ser 

20 25 30 

Ser Glu Asn Ser Thr Thr Ser lie Leu Pro Asn Gin Gin Val Val Thr 
35 . 40 45 
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Lys Glu Val Ser Gin Ala Ala Leu Asn Ala lie Met Lys Gly Gin Pro 
50 55 60 



Asn Met Val Leu lie lie Lys Thr Lys Glu Gly Lys Leu Glu Glu Ala 
65 70 75 80 



Lys Thr. Glu Leu Glu Lys Leu Gly Ala Glu lie Leu Asp Glu Asn Arg 
85 90 95 



Val Leu Asn Met Leu Leu Val Lys lie Lys Pro Glu Lys Val Lys Glu 
100 105 " 110 



Leu Asn Tyr lie Ser Ser Leu Glu Lys Ala Trp Leu Asn Arg Glu Val 
115 120 125 



Lys Leu Ser Pro Pro lie Val Glu Lys Asp Val Lys Thr Lys Glu Pro 
130 135 " ~ 140 



Ser Leu Glu Pro Lys Met Tyr Asn Ser Thr Trp Val lie Asn Ala Leu 
145 150 155 160 



Gin Phe lie Gin Glu Phe Gly Tyr Asp Gly Ser Gly Val Val Val Ala 
165 170 175 



Val Leu Asp Thr Gly Val Asp Pro Asn His Pro Phe Leu Ser lie Thr 
180 185 190 



Pro Asp Gly Arg Arg Lys lie lie Glu Trp Lys Asp Phe Thr Asp Glu 
195 200 205 



Gly Phe Val Asp Thr Ser Phe Ser Phe Ser Lys Val Val Asn Gly Thr 
210 215 220 



Leu lie lie Asn Thr Thr Phe Gin Val Ala Ser Gly Leu Thr Leu Asn 
"2^5 230 235 2tT 



Glu Ser Thr Gly Leu Met Glu Tyr Val Val Lys Thr Val Tyr Val Ser 
245 250 255 



Asn Val Thr lie Gly Asn lie Thr Ser Ala Asn Gly lie Tyr His Phe 
260 265 270 



Gly Leu Leu Pro Glu Arg Tyr Phe Asp Leu Asn Phe Asp Gly Asp Gin 
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275 280 285 



Glu Asp Phe Tyr Pro Val Leu Leu Val Asn Ser Thr Gly Asn Gly Tyr 
290 295 300 



Asp He Ala Tyr Val Asp Thr Asp Leu Asp Tyr Asp Phe Thr Asp Glu 
305 310 315 320 



Val Pro Leu Gly Gin Tyr Asn Val Thr Tyr Asp Val Ala Val Phe Ser 
325 330 335 



Tyr Tyr Tyr Gly Pro Leu Asn Tyr Val Leu Ala Glu He Asp Pro Asn 
340 345 350 



Gly Glu Tyr Ala Val Phe Gly Trp Asp Gly His Gly His Gly Thr His 
355 360 365 



Val Ala Gly Thr. Val Ala Gly Tyr Asp Ser Asn Asn Asp Ala Trp Asp 
370 375 380 



Trp Leu Ser Met Tyr Ser Gly Glu Trp Glu Val Phe Ser Arg Leu Tyr 
385 390 395 400 



Gly Trp Asp Tyr Thr Asn Val Thr Thr Asp Thr Val Gin Gly Val Ala 
405 410 "* 415 



Pro Gly Ala Gin He Met Ala He Arg Val Leu Arg Ser Asp Gly Arg 
420 425 ' 430 



Gly Ser Met Trp Asp He He Glu Gly Met Thr Tyr Ala Ala Thr His 
435 440 445 



Gly Ala Asp Val He Ser Met Ser Leu Gly Gly Asn Ala Pro Tyr Leu 
450 455 4 60 



Asp Gly Thr Asp Pro Glu Ser Val Ala Val Asp Glu Leu Thr Glu Lys 
465 470 475 480 



Tyr Gly Val Val Phe Val He Ala Ala Gly Asn Glu Gly Pro Gly He 
485 490 495 



Asn He Val Gly Ser Pro Gly Val Ala Thr Lys Ala He Thr Val Gly 
500 505 510 
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• 



Ala Ala Ala Val Pro lie Asn Val Gly Val Tyr Val Ser Gin Ala Leu 
515 520 525 



Gly Tyr Pro Asp Tyr Tyr Gly Phe Tyr Tyr Phe Pro Ala Tyr Thr Asn 
530 535 540 



Val Arg lie Ala Phe Phe Ser Ser Arg Gly Pro Arg lie Asp Gly Glu 
545 550 555 560 



lie Lys Pro Asn Val Val Ala Pro Gly Tyr Gly lie Tyr Ser Ser Leu 
565 570 575 



Pro Met Trp lie Gly Gly Ala Asp Phe Met Ser Gly Thr Ser Met Ala 
580 585 590 



Thr Pro His Val Ser Gly Val Val Ala Leu Leu lie Ser Gly Ala Lys 
595 600 605 



Ala Glu Gly lie Tyr Tyr Asn Pro Asp lie lie Lys Lys Val Leu Glu 
610 615 620 



Ser Gly Ala Thr Trp Leu Glu Gly Asp Pro Tyr Thr Gly Gin Lys Tyr 
625 630 635 640 



Thr Glu Leu Asp Gin Gly His Gly Leu Val Asn Val Thr Lys Ser Trp 
645 650 655 



Glu lie Leu Lys Ala lie Asn Gly Thr Thr Leu Pro lie Val Asp His 
660 665 670 



Trp Ala Asp Lys Ser Tyr Ser Asp Phe Ala Glu Tyr Leu Gly Val Asp 
675 680 685 



Val lie Arg Gly Leu Tyr Ala Arg Asn Ser lie Pro Asp lie Val Glu 
6^0 6¥5 7m 



Trp His lie Lys Tyr Val Gly Asp Thr Glu Tyr Arg Thr Phe Glu lie 
705 710 715 720 



Tyr Ala Thr Glu Pro Trp lie Lys Pro Phe Val Ser Gly Ser Val lie 
725 730 735 



Leu Glu Asn Asn Thr Glu Phe Val Leu Arg Val Lys Tyr Asp Val Glu 
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740 745 750 



Gly Leu Glu Pro Gly Leu Tyr Val Gly Arg lie lie lie Asp Asp Pro 
755 760 765 



Thr Thr Pro Val lie Glu Asp Glu lie Leu Asn Thr lie Val lie ■ Pro 
770 775 780 



Glu Lys Phe Thr Pro Glu Asn Asn Tyr Thr Leu Thr Trp Tyr Asp He 
785 790 795 " 800 



Asn Gly Pro Glu Met Val Thr His His Phe Phe Thr Val Pro Glu Gly 
805 810 815 



Val Asp Val Leu Tyr Ala Met Thr Thr Tyr Trp Asp Tyr Gly Leu Tyr 
820 825 J 830 



Arg Pro Asp Gly Met Phe Val Phe Pro Tyr Gin Leu Asp Tyr Leu Pro 
835 840 845 



Ala Ala Val Ser Asn Pro Met Pro Gly Asn Trp Glu Leu Val Trp Thr 
850 855 860 



Gly Phe Asn Phe Ala Pro Leu Tyr Glu Ser Gly Phe Leu Val Arg He 
865 870 875 880 



Tyr Gly Val Glu He Thr Pro Ser Val Trp Tyr He Asn Arg Thr Tyr 
885 890 895 



Leu Asp Thr Asn Thr Glu Phe Ser He Glu Phe Asn He Thr Asn He 
900 905 910 



Tyr Ala Pro He Asn Ala Thr Leu He Pro He Gly Leu Gly Thr Tyr 
915 920 925 



Asn Ala Ser Val Glu . Ser Val Gly Asp Gly Glu Phe Phe He Lys Gly 
930 935 940 



He Glu Val Pro Glu Gly Thr Ala Glu Leu Lys lie Arg He Gly Asn 
945 950 955 960 



Pro Ser Val Pro Asn Ser Asp Leu Asp Leu Tyr Leu Tyr Asp Ser Lys 
965 970 ~ ~ 975 
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Gly Asn Leu Val Ala Leu Asp Gly Asn Pro Thr Ala Glu Glu Glu Val 
980 985 990 



Val Val Glu Tyr Pro Lys Pro Gly Val Tyr Ser He Val Val His Gly 
995 1000 1005 - 



Tyr Ser Val Arg Asp Glu Asn Gly Asn Pro Thr Thr Thr Thr Phe 
1010 1015 1020 



Asp Leu Val Val Gin Met Thr Leu Asp Asn Gly Asn He Lys Leu 
1025 1030 1035 



Asp Lys Asp Ser He He Leu Gly Ser Asn Glu Ser Val Val Val 
1040 1045 1050 



Thr Ala Asn He Thr lie Asp Arg Asp His Pro Thr Gly Val Tyr 
1055 1060 1065 



Ser Gly He He Glu He Arg Asp Asn Glu Val Tyr Gin Asp Thr 
1070 1075 1080 



Asn Thr Ser He Ala Lys He Pro He Thr Leu Val He Asp Lys 
1085 " 1090 1095 



Ala Asp Phe Ala Val Gly Leu Thr Pro Ala Glu Gly Val Leu Gly 
1100 1105 1110 



Glu Ala Arg Asn Tyr Thr Leu He Val Lys His Ala Leu Thr Leu 
1115 1120 1125 



Glu Pro Val Pro Asn Ala Thr Val He He Gly Asn Tyr Thr Tyr 
1130 1135 ~ 1140 



Leu Thr Asp Glu Asn Gly Thr Val Thr Phe Thr Tyr Ala Pro Thr 
ITT5 TT5D H53 



Lys Leu Gly Ser Asp Glu He Thr Val He Val Lys Lys Glu Asn 
1160 1165 1170 



Phe Asn Thr Leu Glu Lys Thr Phe Gin He Thr Val Ser Glu Pro 
1175 1180 1185 



Glu He Thr Glu Glu Asp He Asn Glu Pro Lys Leu Ala Met Ser 
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1190 1195 1200 



Ser Pro Glu Ala Asn Ala Thr lie Val Ser Val Glu Met Glu Ser 
1205 1210 1215 



Glu Gly Gly Val Lys Lys Thr Val Thr Val Glu lie Thr lie Asn 
1220 1225 1230 



Gly Thr Ala Asn Glu Thr Ala Thr He Val Val Pro Val Pro Lys 
1235 1240 1245 



Lys Ala Glu Asn He Glu Val Ser Gly Asp His Val He Ser Tyr 
1250 1255 1260 



Ser He Glu Glu Gly Glu Tyr Ala Lys Tyr Val He He Thr Val 
1265 1270 1275 



Lys Phe Ala Ser Pro Val Thr Val Thr Val Thr Tyr Thr He Tyr 
1280 1285 1290 



Ala Gly Pro Arg Val Ser He Leu Thr Leu Asn Phe Leu Gly Tyr 
1295 1300 1305 



Ser Trp Tyr Arg Leu Tyr Ser Gin Lys Phe Asp Glu Leu Tyr Gin 
1310 1315 1320 



Lys Ala Leu Glu Leu Gly Val Asp Asn Glu Thr Leu Ala Leu Ala 
1325 1330 1335 



Leu Ser Tyr His Glu Lys Ala Lys Glu Tyr Tyr Glu Lys Ala Leu 
1340 1345 1350 



Glu Leu Ser Glu Gly Asn He He Gin Tyr Leu Gly Asp He Arg 
1355 1360 1365 



Leu Leu Pro Pro Leu Arg Gin Ala Tyr He Asn Glu Met Lys Ala 
1370 1375 1380 



Val Lys He Leu Glu Lys Ala He Glu Glu Leu Glu Gly Glu Glu 
1385 1390 1395 



<210> 9 
<211> 35 
<212> DNA 
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<213> 



Artificial Sequence 



<220> 
<223> 



Synthetic 



<400> 9 

ggwwsdrrtg ttrrhgthgc dgtdmtygac acbgg 



35 



<210> 10 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic . 

<400> 10 

kstcacggaa ctcacgtdgc bggmacdgtt gc 32 

<210> 11 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 12 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 12 

chccgsyvac rtgbggagwd gccatbgavg tdcc 34 

<210> 13 

<211> TT5 

<212> DNA 

<213> Thermococcus celer 
<220> 

<221> misc__f eature 

<222> (127) . . (127) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (130) . . (130) 



<400> 11 

ascmgcaach gtkccvgcha cgtgagttcc gtg 



33 
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<223> n is a, c, g, or t 
<220> 

<221> misc_f eature 
<222> (136) . . (136) 

<223> n is a, c, g, or t - 
<400> 13 

a gtt gcg gta att gac acg ggt ata gac gcg aac cac ccc gat ctg aag 4 9 

Val Ala Val lie Asp Thr Gly lie Asp Ala Asn His Pro Asp Leu Lys 
1 5 10 15 

ggc aag gtc ata ggc tgg tac gac gcc gtc aac ggc agg teg acc ccc 97 
Gly Lys Val lie Gly Trp Tyr Asp Ala Val Asn Gly Arg Ser Thr Pro 
20 25 30 

tac gat gac cag gga cac gga act cac gtn gen gga acn gtt get ggt 145 
Tyr Asp Asp Gin Gly His Gly Thr His Val Ala Gly Thr Val Ala Gly 
35 40 45 



<210> 14 
<211> 564 
<212> DNA 

<213> Thermococcus celer 
<400> 14 

tct cac gga act cac gtg gcg gga aca gtt gcc gga aca ggc age gtt 4 8 

Ser His Gly Thr His Val Ala Gly Thr Val Ala Gly Thr Gly Ser Val 
1 5 10 15 

aac tec cag tac ata ggc gtc gcc ccc ggc gcg aag etc gtc ggt gtc 96 
Asn Ser Gin Tyr lie Gly Val Ala Pro Gly Ala Lys Leu Val Gly Val 
20 25 30 

aag gtt etc ggt gcc gac ggt teg gga age gtc tec acc ate ate gcg 14 4 

Lys Val Leu Gly Ala Asp Gly Ser Gly Ser Val Ser Thr He He Ala 
35 .40 45 

ggt gtt gac tgg gtc gtc cag aac aag gat aag tac ggg ata agg gtc 192 
Gly Val Asp Trp Val Val Gin Asn Lys Asp Lys Tyr Gly He Arg Val 
50 55 ^60 

ate aac etc tec etc ggc tec tec cag age tec gac gga gcc gac tec 240 
He Asn Leu Ser Leu Gly Ser Ser Gin Ser Ser Asp Gly Ala Asp Ser 
~63 TO T5 8*0 



etc agt cag gcc gtc aac aac gcc tgg gac gcc ggt ata gta gtc tgc 288 

Leu Ser Gin Ala Val Asn Asn Ala Trp Asp Ala Gly He Val Val Cys 

85 90 95 

gtc gcc. gcc ggc aac age ggg ccg aac acc tac acc gtc ggc tea ccc 336 

Val Ala Ala Gly Asn Ser Gly Pro Asn Thr Tyr Thr Val Gly Ser Pro 

100 105 110 

gcc gcc gcg age aag gtc ata acc gtc ggt gca gtt gac age aac gac 384 

Ala Ala Ala Ser Lys Val He Thr Val Gly Ala Val Asp Ser Asn Asp 
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115 120 125 

aac ate gec age ttc tec age agg gga ccg ace gcg gac gga agg etc 432 
Asn lie Ala Ser Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu 
130 135 140 

aag ccg gaa gtc gtc gec ccc ggc gtt gac ate ata gee ccg cgc gec 480 
Lys Pro Glu Val Val Ala Pro Gly Val Asp lie lie Ala Pro Arg Ala 
145 150 155 160 

age gga ace age atg ggc ace ccg ata aac gac tac tac acc aag gec 528 
Ser Gly Thr Ser Met Gly Thr Pro lie Asn Asp Tyr Tyr Thr Lys Ala 
165 170 *" " " 175 

tct gga acc tea atg gec act ccc cat gtt acc ggt 564 
Ser Gly Thr Ser Met Ala Thr Pro His Val Thr Gly 
180 185 



<210> 15 

<211> 1859 

<212> DNA 

<213> Thermococcus celer 

<400> 15 



gagctccgac 


ggaaccgact 


ccctcagtca 


ggccgtcaac 


aacgcctggg 


aegceggtat 


60 


agtagtctgc 


gtcgccgccg 


geaacagegg 


gccgaacacc 


tacaccgtcg 


gctcacccgc 


120 


cgccgcgagc 


aaggtcataa 


ccgtcggtgc 


agttgacagc 


aacgacaaca 


tcgccagctt 


180 


ctccagcagg 


ggaccgaccg 


eggaeggaag 


gctcaagccg 


gaagtegteg 


cccccggcgt 


24 0 


tgacatcata 


gccccgcgcg 


ccagcggaac 


cagcatgggc 


accccgataa 


acgactacta 


300 


caccaaggcc 


tctggaacca 


gcatggccac 


cccgcacgtt 


tcgggcgttg 


gcgcgctcat 


360 


cctccaggcc 


cacccgagct 


ggaccccgga 


caaggtgaag 


accgccctca 


tcgagaccgc 


420 


cgacatagtc 


gcccccaagg 


agatagegga 


catcgcctac 


ggtgcgggta 


gggtgaacgt 


480 


ctacaaggcc 


atcaagtacg 


acgactacgc 


caagctcacc 


ttcaccggct 


ccgtcgccga 


540 


caagggaagc 


gccacccaca 


ccttcgacgt 


cagcggcgcc 


accttcgtga 


ccgccaccct 


600 


ctactgggac 


aegggctega 


gegacatega 


cctctacctc 


tacgacccca 


aegggaaega 


660 


ggttgactac tcctacaccg cctactacgg 


cttcgagaag 


gteggctact 


acaacccgac 


720 


cgccggaacc 


tggaeggtea 


aggtegtcag 


ctacaagggc 


geggegaact 


accaggtcga 


780 


cgtcgtcagc 


gaegggagee 


tcagccagtc 


cggcggcggc 


aacccgaatc 


caaaccccaa 


840 


cccgaaccca 


accccgacca 


ccgacaccca 


gaccttcacc 


ggttccgtta 


acgactactg 


900 


ggacaccagc 


gacaccttca 


ecatgaaegt 


caacageggt 


gccaccaaga 


taaccggtga 


960 


cctgaccttc 


gatacttcct 


acaacgacct 


cgacctctac 


ctctacgacc 


ccaacggcaa 


1020 



- 25 - 





• 






• 






cctcgttgac 


aggtccacgt 


cgagcaacag 


ctacgagcac 


gtcgagtacg 


ccaaccccgc 


1080 


cccgggaacc 


tggacgttcc 


tcgtctacgc 


ctacagcacc 


tacggctggg 


cggactacca 


1140 


gctcaaggcc 


gtcgtctact 


acgggtgaag 


gtttttaatc 


cccttttctt 


tccccttttg 


1200 


aggtggttgg 


gatgaagcgg 


gttcttgcgg 


cgatccttgt 


aatcatgctc 


atcggattat 


1260 


cattccctgc 


cggaagtgct 


aaaatcgagc 


cctacgttta 


cagccccacc 


gttccggata 


1320 


ccgccttcgc 


ggttctcacc 


ctgtacagga 


ccggggacta 


cgcccgggtt 


ctcgagggat 


.1380 


acgagtggct 


cctccagatg 


agaactccca 


tcgattcgtg 


gggggtttcc 


cgcggggaaa 


1440 


cgcacatggc 


caagtacacg 


gcaatggcga 


tgctggccct 


catgcgcggc 


gagaacgtgg 


1500 


cgagggggcg 


ttacagggat 


gttctcaacg 


acgccgcgta 


ctggttaata 


tacaaacaga 


1560 


acccggacgg 


ctcgtgggag 


gactacaccg 


gaacggcgct 


ggccgtcatc 


gcgctcgggg 


1620 


agttccttaa 


gggcgggtac 


atcaacgcga 


acctgaccgg 


cttcaaaaag 


caggttaaag 


1680 


aggccgtaaa 


ccgcggggaa 


ggctggctga 


tggatgcgga 


cccaaaaacg 


gacgcggata 


1740 


gaatattcgg 


ctacctcgcc 


ctcggtaaaa 


aggacgaact 


caaaaagatg 


aacccttccg 


1800 


gtgacctgaa 


ggcctaccgc 


gcctttgcac 


ttgcctacct 


cggggagagg 


gtcgagctc 


1859 



<210> 16 
<211> 20 
<212> DNA 

<213> Thermococcus celer 
<400> 16 

tgtagtagtc gtttatcggg 20 

<210> 17 
<211> 1464 
<212> DNA 

<213> Thermococcus celer 
<400> 17 

aagcttaaca tcgagcgctc cacctctaaa gtaggtgagt gtggatacga aggttagggc 60 
cgctatgacg accttcagga tcccaacggc ttcttttatg gggagcccgg cgaaggtgag 120 
aattgaaagg attaccatac tccctccgct catcatggag cctatgaatc cccctccaaa 180 
agagagaagt gctataagga gcgtcctcat gttccatgct atgttttggt atttaatgct 240 
tttccgctta atgttacacc tcctcatgac aatttcgcgt ttagggatgg ggttaattgg 300 
acccctccga gccacgggtt gatgtccatt atgtcgatat tcaccatctt atccccaact 360 
ttgtgggttt caaacattac cctacgttat atttttatcg tcctaattaa ctgctgaaac 420 
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# 






• 






gggcgcttat 


cgttcatcgt 


tgatggtttt 


gggtgaccgg 


gcattaagga 


attgtgtcgt 


480 


ttgctgaaat 


ttatgaaacg 


gagttggctt 


ctttatgtta 


cataaagatg 


tacattactg 


540 


taatgtatat 


aaatggaaga 


aacactgttg 


cgtaaacttt 


ttaatgtatc 


caatatcagt 


600 


acttcgatgt 


cccgatatgg 


gacatgttgg 


ataggagggt 


actggaatga 


agaggttagg 


. 660 


tgctgtggtg 


ctggcactgg 


tgctcgtggg 


tcttctggcc 


ggaacggccc 


ttgcggcacc 


720 


cgtaaaaccg 


gttgtcagga 


acaacgcggt 


tcagcagaag 


aactacggac 


tgctgacccc 


780 


gggactgttc 


aagaaagtcc 


agaggatgaa 


ctggaaccag 


gaagtggaca 


ccgtcataat 


840 


gttcgggagc 


tacggagaca 


gggacagggc 


ggttaaggta 


ctgaggctca 


tgggcgccca 


900 


ggtcaagtac 


tcctacaaga 


taatccctgc 


tgtcgcggtt 


aaaataaagg 


ccagggacct 


960 


tctgctgatc 


gcgggcatga 


tagacacggg 


ttacttcggt 


aacacaaggg 


tctcgggcat 


1020 


aaagttcata 


caggaggatt 


acaaggttca 


ggttgacgac 


gccacttccg 


tctcccagat 


1080 


aggggccgat 


accgtctgga 


actccctcgg 


ctacgacgga 


agcggtgtgg 


tggttgccat 


1140 


cgtcgatacg 


ggtatagacg 


cgaaccaccc 


cgatctgaag 


ggcaaggtca 


taggctggta 


1200 


cgactccgtc 


aacggcaggt 


cgacccccta 


cgatgaccag 


ggacacggaa 


cccacgttgc 


1260 


gggtatcgtt 


gccggaaccg 


ggagcgttaa 


ctcccagtac 


ataggcgtcg 


gccccggcgc 


1320 


gaagctcgtc 


ggcgtcaagg 


ttctcggttc 


cgacggttcg 


ggaagcgtct 


ccaccatcat 


1380 


cgcgggtgtt 


gactggaacg 


tccagaacta 


ggacaagtac 


gggataaggg 


tcatcaacct 


1440 


ctccctcggc 


tcctcccaga 


gctc 








1464 



<210> 18 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<4 0 0> TB : 

aaaagaattc ggatccatga agaggttagg tgc 33 

<210> 19 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
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<400> 19 

ttttatcgat caggcgtccc aggcgttg 



28 



<210> 20 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 20 

cattataggt aagagaggaa tg - 22 



<210> 21 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 22 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 22 

tagcagtaat tgacacggg 19 



<210> 23 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<Z'Z6> byntnetic ' : 

<400> 23 

tagcagtaat tgacactgg 19 



<210> 24 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 



<400> 21 

gatccattcc tctcttacct ataatggtac 



30 



- 28 - 



<223> Synthetic 
<400> 24 

ctgttccagc tacgtgagtt cc 



22 



<210> 25 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 25 

ctgttccagc tacatgagtt cc 



22 



<210> 26 
<211> 507 
<212> DNA 

<213> Pyrococcus furiosus 
<400> 26 

a eta gtc ate tea ggt tta aca ggg ggt aaa get aag ctt tea ggt gtt 
Leu Val lie Ser Gly Leu Thr Gly Gly Lys Ala Lys Leu Ser Gly Val 
1 5 10 • 15 



49 



agg ttt ate cag gaa gac tac aaa gtt. aca gtt tea gca gaa tta gaa 
Arg Phe lie Gin Glu Asp Tyr Lys Val Thr Val Ser Ala Glu Leu Glu 
20 25 30 



97 



gga ctg gat gag tct gca get caa gtt atg gca act tac gtt tgg aac 
Gly Leu Asp Glu Ser Ala Ala Gin Val Met Ala Thr Tyr Val Trp Asn 
35 40 45 



145 



ttg gga tat gat ggt tct gga ate aca ata gga ata att gac act gga 
Leu Gly Tyr Asp Gly Ser Gly lie Thr lie Gly lie lie Asp Thr Gly 
50 55 60 



193 



att gac get tct cat cca gat etc caa gga aaa gta att ggg tgg gta 
lie Asp Ala Ser His Pro Asp Leu Gin Gly Lys Val lie Gly Trp Val 
65 * 70 75 80 



241 



gat ttt gtc aat ggt agg agt tat cca tac gat gac cat gga cat gga 
Asp Phe Val Asn Gly Arg Ser Tyr Pro Tyr Asp Asp His Gly His Gly 
83 9tJ 95 



289 



act cat gta get tea ata gca get ggt act gga gca gca agt aat ggc 
Thr His Val Ala Ser He Ala Ala Gly Thr Gly Ala Ala Ser Asn Gly 
100 105 110 



337 



aag tac aag gga atg get cca gga get aag ctg gcg gga att aag gtt 
Lys Tyr Lys Gly Met Ala Pro Gly Ala Lys Leu Ala Gly He Lys Val 
115 120 125 



385 



eta ggt gee gat ggt tct gga age ata tct act ata att aag gga gtt 
Leu Gly Ala Asp Gly Ser Gly Ser. He Ser. Thr He He Lys Gly Val 



433 
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130 



135 



140 



gag tgg gcc gtt gat aac aaa gat aag tac gga att aag gtc att aat 
Glu Trp Ala Val Asp Asn Lys Asp Lys Tyr Gly lie Lys Val lie Asn 
145 150 "* 155 160 



481 



ctt tct ctt ggt tea age cag age tc 
Leu Ser Leu Gly Ser Ser Gin Ser 
165 



507 



<210> 27 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 27 

tgacactgga attgacgett ctcatccaga 



30 



<210> 28 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 28 

tctccaagga aaagtaattg ggtgggtaga 



30 



<210> 29 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 29 

gttgccataa ettgagctge agactcatcc 



30 



<ZIU> 3D 

<211> 419 

<212> DNA 

<213> Pyrococcus furiosus 



<400> 30 

tttattaagc ataaaatagc catgeaaett tgatcactaa tgtgcggtgg tgcac atg 

Met 
1 



58 



aag ggg ■ ctg aaa get etc ata tta gtg att tta gtt eta ggt ttg gta 
Lys Gly Leu Lys Ala Leu lie Leu Val lie Leu Val Leu Gly Leu Val 



106 
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10 



15 



gta ggg age gta gcg gca get cca gag aag. aaa gtt gtt caa gta aga 
Val Gly Ser Val Ala Ala Ala Pro Glu Lys Lys Val Val Gin Val Arg 
20 25 30 



154 



aat gtt gag aag aac tat ggt ctg eta acg cca gga ctg ttc aga aaa 
Asn Val Glu Lys Asn Tyr Gly Leu Leu Thr Pro Gly Leu Phe Arg Lys 
35 40 45 



202 



att ccc aaa ttg gat cct aac gag gga ate age aca gta att gta ttt 
lie Pro Lys Leu Asp Pro Asn Glu Gly lie Ser Thr Val lie Val Phe 
50 55 60 65 



250 



gtt aac cat agg gga aaa gaa att gca gta aga gtt ctt gag tta atg 
Val Asn His Arg Gly Lys Glu lie Ala Val Arg Val Leu Glu Leu Met 
70 75 80 



298 



ggt gee caa gtt agg tat gtg tac cat att ata ccc cca ata get gee 
Gly Ala Gin Val Arg Tyr Val Tyr His lie lie Pro Pro lie Ala Ala 
85 90 95 



346 



gat ctt aag gtt aga gac tta eta gtc ate tea ggt tta aca ggg ggt 
Asp Leu Lys Val Arg Asp Leu Leu Val lie Ser Gly Leu Thr Gly Gly 
100 105 110 



394 



gaa act aag ctt tea ggt gtt agg t 
Glu Thr Lys Leu Ser Gly Val Arg 
115 " 120 



419 



<210> 31 
<211> 180 
<212> DNA 

<213> Pyrococcus furiosus 
<400> 31 

gctctagact ctgggaggag tagttatact tgatgaagee tattctgagt tttcgggaaa 
aagcttcata ccaaaaatca gtgagtatga aaatttagta attctaagga cgttttcaaa 
ggcgtttgga cttgctggaa ttagatgtgg atatatgata gcaaatgaaa agattataga 



60 
120 
180 



<210> 32 

<211> 28 

<2i2> um : 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 32 

agagggatcc atgaaggggc tgaaagct 



28 



<210> 33 
<211> 30 



- 31 - 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 33 

agaggcatgc gctctagact ctgggagagt 30 

<210> 34 

<211> 1962 

<212> DNA 

<213> Pyrococcus furiosus 

<400> 34 



atgaaggggc 


tgaaagctct 


catattagtg 


attttagttc 


taggtttggt 


agtagggagc 


60 


gtagcggcag 


ctccagagaa 


gaaagttgaa 


caagtaagaa 


atgttgagaa 


gaactatggt 


120 


ctgctaacgc 


caggactgtt 


cagaaaaatt 


caaaaattga 


atcctaacga 


ggaaatcagc 


180 


acagtaattg 


tatttgaaaa 


ccatagggaa 


aaagaaattg 


cagtaagagt 


tcttgagtta 


240 


atgggtgcaa 


aagttaggta 


tgtgtaccat 


attatacccg 


caatagctgc 


cgatcttaag 


300 


gttagagact 


tactagtcat 


ctcaggttta 


acagggggta 


aagctaagct 


ttcaggtgtt 


360 


aggtttatcc 


aggaagacta 


caaagttaca 


gtttcagcag 


aattagaagg 


actggatgag • 


420 


tctgcagctc 


aagttatggc 


aacttacgtt 


tggaacttgg 


gatatgatgg 


ttctggaatc 


480 


acaataggaa 


taattgacac 


tggaattgac 


gcttctcatc 


cagatctcca 


aggaaaagta 


540 


attgggtggg 


tagattttgt 


caatggtagg 


agttatccat 


acgatgacca 


tggacatgga 


600 


actcatgtag 


cttcaatagc 


agctggtact 


ggagcagcaa 


gtaatggcaa 


gtacaaggga 


660 


atggctccag 


gagctaagct 


ggcgggaatt 


aaggttctag 


gtgccgatgg 


ttctggaagc 


720 


atatctacta 


taattaaggg 


agttgagtgg 


gccgttgata 


acaaagataa 


gtacggaatt 


780 


aaggtcatta 


atctttctct 


tggttcaagc 


cagagctcag 


atggtactga 


cgctctaagt 


840 


caggctgtta 


atgcagcgtg 


ggatgctgga 


ttagttgttg 


tggttgccgc 


tggaaacagt 


900 


ggacctaaca 


agtatacaat 


cggttctcca 


gcagctgcaa 


gcaaagttat 


tacagttgga 


960 


gccgttgaca 


agtatgatgt 


tataacaagc 


ttctcaagca 


gagggccaac 


tgcagacggc 


1020 


aggcttaagc 


ctgaggttgt 


tgctccagga 


aactggataa 


ttgctgccag 


agcaagtgga 


1080 


actagcatgg 


gtcaaccaat 


taatgactat 


tacacagcag 


ctcctgggac 


atcaatggca 


1140 


actcctcacg 


tagctggtat 


tgcagccctc 


ttgctccaag 


cacacccgag 


ctggactcca 


1200 


gacaaagtaa 


aaacagccct 


catagaaact 


gctgatatcg 


taaagccaga 


tgaaatagcc 


1260 
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• 






• 






gatatagcct 


acggtgcagg 


tagggttaat 


gcatacaagg 


ctataaacta 


cgataactat 


1320 


gcaaagctag 


tgttcactgg 


atatgttgcc 


aacaaaggca 


gccaaactca 


ccagttcgtt 


1380 


attagcggag 


cttcgttcgt 


aactgccaca 


ttatactggg 


acaatgccaa 


tagcgacctt 


1440 


gatctttacc 


tctacgatcc 


caatggaaac 


caggttgact 


actcttacac 


cgcctactat 


1500 


ggattcgaaa 


aggttggtta 


ttacaaccca 


actgatggaa 


catggacaat 


taaggttgta 


1560 


agctacagcg 


gaagtgcaaa 


ctatcaagta 


gatgtggtaa 


gtgatggttc 


cctttcacag 


1620 


cctggaagtt 


caccatctcc 


acaaccagaa 


ccaacagtag 


acgcaaagac 


gttccaagga 


1680 


tccgatcact 


actactatga 


caggagcgac 


acctttacaa 


tgaccgttaa 


ctctggggct 


1740 


acaaagatta 


ctggagacct 


agtgtttgac 


acaagctacc. 


atgatcttga 


cctttacctc 


1800 


tacgatccta 


accagaagct 


tgtagataga 


tcggagagtc 


ccaacagcta 


cgaacacgta 


1860 


gaatacttaa 


cccccgcccc 


aggaacctgg 


tacttcctag 


tatatgccta 


ctacacttac 


1920 


ggttgggctt 


actacgagct 


gacggctaaa 


gtttattatg 


gc 




1962 



<210> 35 
<211> 654 
<212> PRT 

<213> Pyrococcus furiosus 
<400> 35 

Met Lys Gly Leu Lys Ala Leu lie Leu Val lie Leu Val Leu Gly Leu 
15 10 15 



Val Val Gly Ser Val Ala Ala Ala Pro Glu Lys Lys Val Glu Gin Val 
20 25 30 



Arg Asn Val Glu Lys Asn Tyr Gly Leu Leu Thr Pro Gly Leu Phe Arg 
35 40 45 



Lys lie Gin Lys Leu Asn Pro Asn Glu Glu lie Ser Thr Val lie. Val 
50 55 60 



Phe Glu Asn His Arg Glu Lys Glu lie Ala Val Arg Val Leu Glu Leu 
65 70 75 80 



Met Gly Ala Lys Val Arg Tyr Val Tyr His lie lie Pro Ala lie Ala 
85 90 95 



Ala Asp Leu Lys Val Arg Asp Leu Leu Val lie Ser Gly Leu Thr Gly 
100 105 ~ 110 
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Gly Lys Ala Lys Leu Ser. Gly Val Arg Phe lie Gin -Glu Asp Tyr Lys 
115 120 125 



Val Thr Val Ser Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gin 
130 135 140 



Val Met Ala Thr Tyr Val Trp Asn Leu Gly Tyr Asp Gly Ser Gly lie 
145 150 155 160 



Thr lie Gly lie lie Asp Thr Gly lie Asp Ala Ser His Pro Asp Leu 
165 170 175 



Gin Gly Lys Val lie Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr 
180 185 190 



Pro Tyr Asp Asp His Gly His Gly Thr His Val Ala Ser lie Ala Ala 
195 200 205 



Gly Thr Gly Ala Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala Pro Gly 
210 215 220 



Ala Lys Leu Ala Gly lie Lys Val Leu Gly Ala Asp Gly Ser Gly Ser 
225 230 235 240 



lie Ser Thr lie lie Lys Gly Val Glu Trp Ala Val Asp Asn Lys Asp 
245 250 255 



Lys Tyr Gly lie Lys Val lie Asn Leu Ser Leu Gly Ser Ser Gin Ser 
260 265 270 



Ser Asp Gly Thr Asp Ala Leu Ser Gin Ala Val Asn Ala Ala Trp Asp 
275 280 285 



Ala <STy Teu" Val VaJ~^7alT VaT~ ~ATa~ Al a GTy ~Ksn~Se r ' *Giy7r'6~ As n Lys 
290 295 300 



Tyr Thr He Gly Ser Pro Ala Ala Ala Ser Lys Val He Thr Val Gly 
305 310 315 320 



Ala Val Asp Lys Tyr Asp Val He Thr Ser Phe Ser Ser Arg Gly Pro 
325 330 335 
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Thr Ala Asp Gly Arg Leu Lys Pro Glu Val Val Ala Pro Gly Asn Trp 
340 345 350 



lie He Ala Ala Arg Ala Ser Gly Thr Ser Met Gly Gin Pro He Asn 
355 360 365 



Asp Tyr Tyr Thr Ala Ala Pro Gly Thr Ser Met Ala Thr Pro His. Val 
370 • 375 380 



Ala Gly He Ala Ala Leu Leu Leu Gin Ala His Pro Ser Trp Thr Pro 
385 390 395 400 



Asp Lys Val Lys Thr Ala Leu He Glu Thr Ala Asp He Val Lys Pro 
405 410 " 415 



Asp Glu He Ala Asp He Ala Tyr Gly Ala Gly Arg Val Asn Ala Tyr 
420 425 430 



Lys Ala He Asn Tyr Asp Asn Tyr Ala Lys Leu Val Phe Thr Gly Tyr 
435 440 445 



Val Ala Asn Lys Gly Ser Gin Thr His Gin Phe Val lie Ser Gly Ala 
450 455 460 



Ser Phe Val Thr Ala Thr Leu Tyr Trp Asp Asn Ala Asn Ser Asp Leu 
465 470 475 480 



Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Gin Val Asp Tyr Ser Tyr 
485 490 * ^ 495 



Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro Thr Asp 
500 505 " 510 



Gly Thr Trp Thr He Lys Val Val Ser Tyr Ser Gly Ser Ala Asn Tyr 
515 520 525 



Gin Val Asp Val Val Ser Asp Gly Ser. Leu Ser Gin Pro Gly Ser Ser 
530 535 540 



Pro Ser Pro Gin Pro Glu Pro Thr Val Asp Ala Lys Thr Phe Gin Gly 
545 550 555 " 560 



Ser Asp His Tyr Tyr Tyr Asp Arg Ser Asp Thr Phe Thr Met Thr Val 
565 570 575 
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Asn Ser Gly Ala Thr Lys lie Thr Gly Asp Leu Val Phe Asp Thr Ser 
580 585 590 



Tyr His Asp Leu Asp Leu Tyr Leu Tyr Asp Pro Asn Gin Lys Leu Val 
595 600 * 605 



Asp Arg Ser Glu Ser Pro Asn Ser Tyr Glu His Val Glu Tyr Leu* Thr 
610 615 620 



Pro Ala Pro Gly Thr Trp Tyr Phe Leu Val Tyr Ala Tyr Tyr Thr Tyr 

635 640 



650 



625 


630 


Gly Trp Ala Tyr Tyr Glu Leu 




645 


<210> 


36 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<400> 


36 


tctgaattcg ttcttttctg tatgg 


<210> 


37 


<211> 


20 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<400> 


37 


tgtactgctg gatccggcag 




Jb 


<211> 


80 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<400> 


38 



25 



20 



ggatccatca gatttttgag tgtagatcaa ccagtatgct gcatttgtaa ttgtgagata 60 
atatctcccg cgggtaaggt 80 



- 36 - 





<210> 39 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 39 

agaggcatgc gtatccatca gatttttgag 30 

<210> 40 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 41 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 41 

gttccaagta tccgttcact 20 



<210> 42 
<211> 12 
<212> PRT 

<213> Pyrococcus furiosus 
<400> 42 

Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gin 
15 10 



<210> 43 

<2il> 4 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 40 

agtgaacgga tacttggaac 



20 • 



<220> 
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<221> misc_feature 

<222> (1)..(1) 

<223> Leu is modified by a succinyl group. 
<220> 

<221> misc_f eature 

<222> (4) . . (4) 

<223> Tyr is modified by a 7-amino-4-methylcoumarin group. 

<400> 43 

Leu Leu Val Tyr 
1 



<210> 44 

<211> 4 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



<220> 

<221> mi sc_f eature 

<222> (1) . . (1) 

<223> Ala is modified by a succinyl group. 
<220> 

<221> misc_f eature 

<222> (4) . . (4) 

<223> Phe is modified by a p-nitroaniline group. 

<400> 44 

Ala Ala Pro Phe 
1 



<210> 45 

<211> 382 

<212> PRT 

<213> subtilisin 



<400> 45 



Met Arg Gly Lys Lys Val Trp lie Ser Leu Leu Phe Ala Leu Ala Leu 
15 10 15 



lie Phe Thr Met Ala Phe Gly Ser Thr Ser Ser Ala Gin Ala Ala Gly 
20 25 30 



Lys Ser Asn Gly Glu Lys Lys Tyr lie Val Gly Phe Lys Gin Thr Met 
35 40 45 



- 38 - 



Ser Thr Met Ser Ala Ala Lys Lys Lys Asp Val lie Ser Glu Lys Gly 
50 55 60 



Gly Lys Val Gin Lys Gin Phe Lys Tyr Val Asp Ala Ala Ser Ala Thr 
65 70 " 75 80 



Leu Asn Glu Lys Ala Val Lys Glu Leu Lys Lys Asp Pro Ser Val Ala 
85 90 95 



Tyr Val Glu Glu Asp His Val Ala His Ala Tyr Ala Gin Ser Val Pro 
100 105 110 



Tyr Gly Val Ser Gin lie Lys Ala Pro Ala Leu His Ser Gin Gly Tyr 
115 120 125 



Thr Gly Ser Asn Val Lys Val Ala Val lie Asp Ser Gly lie Asp Ser 
130 135 140 



Ser His Pro Asp Leu Lys Val Ala Gly Gly Ala Ser Met Val Pro Ser 
145 150 155 160 



Glu Thr Asn Pro Phe Gin Asp Asn Asn Ser His Gly Thr His Val Ala 
165 170 175 



Gly Thr Val Ala Ala Leu Asn Asn Ser lie Gly Val Leu Gly Val Ala 
180 185 190 



Pro Ser Ala Ser Leu Tyr Ala Val Lys Val Leu Gly Ala Asp Gly Ser 
195 200 . 205 



Gly Gin Tyr Ser Trp lie He Asn Gly He Glu Trp Ala He Ala Asn 
210 215 220 



Asn Met Asp Val He Asn Met Ser Leu Gly Gly Pro Ser Gly Ser. Ala 
-2T5 230 ■ "233 "2*0 



Ala Leu Lys Ala Ala Val Asp Lys Ala Val Ala Ser Gly Val Val Val 
245 250 255 



Val Ala Ala Ala Gly Asn Glu Gly Thr Ser Gly Ser Ser Ser Thr Val 
260 265 270 



Gly Tyr Pro Gly Lys Tyr Pro Ser Val He Ala Val Gly Ala Val Asp 
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275 280 285 



Ser Ser Asn Gin Arg Ala Ser Phe Ser Ser Val Gly Pro Glu Leu Asp 
290 295 300 



Val Met Ala Pro Gly Val Ser lie Gin Ser Thr Leu Pro Gly Asn Lys 
305 310 315 " 320 



Tyr Gly Ala Tyr Asn Gly Thr Ser Met Ala Ser Pro His( Val Ala Gly 
325 330 335 



Ala Ala Ala Leu lie Leu Ser Lys His Pro Asn Trp Thr Asn Thr Gin 
340 " 345 350 



Val Arg Ser Ser Leu Glu Asn Thr Thr Thr Lys Leu Gly Asp Ser Phe 
355 360 ^ 365 



Tyr Tyr Gly Lys Gly Leu lie Asn Val Gin Ala Ala Ala Gin 
370 375 380 
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